A Systematic Review of Fault Prediction approaches used in Software Engineering

نویسندگان

  • Sarah Beecham
  • Tracy Hall
  • David Bowes
  • Steve Counsell
  • David Gray
  • Sue Black
چکیده

BACKGROUND – The accurate prediction of where faults are likely to occur in code is important because it can help direct test effort, reduce costs and improve the quality of software. OBJECTIVE – To summarise and analyse the published fault prediction studies in order to identify approaches used to build, measure and validate the performance of fault prediction models. METHOD – A systematic literature review of fault prediction in 148 studies published from January 2000 to December 2009. Studies are classified in terms of their context, the variables and methods used to build models as well as how the performance of a model is measured and validated. RESULTS – An increasing number of studies use machine learning approaches to predict where faults are likely to occur in code. These use a wide variety of methods to build models. Fault prediction models are based mainly on static code metrics, change data and previous fault data. The performance of models is measured in a range of ways that makes cross comparison very complex. The external validation of models is rarely demonstrated. Models reporting very high performance measures (eg. over 90%) need to be treated with particular caution. CONCLUSION – The literature in this area is just beginning to mature with a small but growing number of studies appearing that transparently report models built using rigorous techniques; the performance of these models have been measured in credible ways. However, many studies do not present their methods clearly enough to enable the performance of their models to be convincingly demonstrated. A more standardised way of building and reporting on such performance is needed before potential model users can confidently evaluate these studies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Classifiers in Software Fault-Proneness Prediction

Reliability of software counts on its fault-prone modules. This means that the less software consists of fault-prone units the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of software, it will be possible to judge the software reliability. In predicting software fault-prone modules, one of the contributing features is software metric by which one ...

متن کامل

A Systematic Literature Review on Software Fault Prediction based on Qualitative and Quantitative Factors

The growing demand for higher operational effectiveness and reliability in industrial processes has resulted in a huge attention in fault detection techniques. Researcher and practitioners are remains concerned with correct prediction when developing systems. On the other hand the most popular research area is software fault or fault prediction. Software fault prediction has both security and f...

متن کامل

Software fault prediction metrics: A systematic literature review

Context: Software metrics may be used in fault prediction models to improve software quality by predict-

متن کامل

A Systematic Review of Fault Prediction Performance in Software Engineering

BACKGROUND – The accurate prediction of where faults are likely to occur in code is important since it can help direct test effort, reduce costs and improve the quality of software. As a consequence, many different fault prediction models have been developed and reported in the literature. However, there is no consensus on what constitutes effective fault prediction. OBJECTIVE – To investigate ...

متن کامل

Data-Driven Approaches to Improve the Quality of Clinical Processes: A Systematic Review

Background: Considering the emergence of electronic health records and their related technologies, an increasing attention is paid to data driven approaches like machine learning, data mining, and process mining. The aim of this paper was to identify and classify these approaches to enhance the quality of clinical processes. Methods: In order to determine the knowledge related to the research ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010